Skip to content

【25-Q3-生态建设】模型迁移-研发效能部-模型训练-在PyTorch框架上支持continual-learning在MNIST数据集上的训练#441

Open
Hyun-hu wants to merge 1 commit intoTecorigin:mainfrom
Hyun-hu:contrib/hyun/continual-learning
Open

【25-Q3-生态建设】模型迁移-研发效能部-模型训练-在PyTorch框架上支持continual-learning在MNIST数据集上的训练#441
Hyun-hu wants to merge 1 commit intoTecorigin:mainfrom
Hyun-hu:contrib/hyun/continual-learning

Conversation

@Hyun-hu
Copy link

@Hyun-hu Hyun-hu commented Dec 22, 2025

软件栈版本
--------------+----------------------------------------------
Host IP | 127.0.1.1
PyTorch | 2.4.0a0+git4451b0e
Torch-SDAA | 2.0.0
--------------+----------------------------------------------
SDAA Driver | 2.2.0b2 (N/A)
SDAA Runtime | 2.0.0 (/opt/tecoai/lib64/libsdaart.so)
SDPTI | 1.3.1 (/opt/tecoai/lib64/libsdpti.so)
TecoDNN | 2.0.0 (/opt/tecoai/lib64/libtecodnn.so)
TecoBLAS | 2.0.0 (/opt/tecoai/lib64/libtecoblas.so)
CustomDNN | 1.22.0 (/opt/tecoai/lib64/libtecodnn_ext.so)
TecoRAND | 1.8.0 (/opt/tecoai/lib64/libtecorand.so)
TCCL | 1.21.0 (/opt/tecoai/lib64/libtccl.so)
--------------+----------------------------------------------
工作目录:PyTorch/contrib/other/continual-learning
适配内容:使用1张TECO_AICARD_01芯片,在PyTorch框架上支持continual-learning在MNIST数据集上的训练
运行脚本见模型readme
loss
MeanRelativeError: -0.121553406
MeanAbsoluteError: 0.00038663193
Rule,mean_relative_error -0.121553406
pass mean_relative_error=-0.121553406 <= 0.05 or mean_absolute_error=0.00038663193 <= 0.0002

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant